Loving County
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach
Web-scale visual entity recognition, the task of associating images with their corresponding entities within vast knowledge bases like Wikipedia, presents significant challenges due to the lack of clean, large-scale training data. In this paper, we propose a novel methodology to curate such a dataset, leveraging a multimodal large language model (LLM) for label verification, metadata generation, and rationale explanation. Instead of relying on the multimodal LLM to directly annotate data, which we found to be suboptimal, we prompt it to reason about potential candidate entity labels by accessing additional contextually relevant information (such as Wikipedia), resulting in more accurate annotations. We further use the multimodal LLM to enrich the dataset by generating question-answer pairs and a grounded finegrained textual description (referred to as "rationale") that explains the connection between images and their assigned entities. Experiments demonstrate that models trained on this automatically curated data achieve state-of-the-art performance on web-scale visual entity recognition tasks (e.g.
Semi-Self-Supervised Domain Adaptation: Developing Deep Learning Models with Limited Annotated Data for Wheat Head Segmentation
Ghanbari, Alireza, Shirdel, Gholamhassan, Maleki, Farhad
Precision agriculture involves the application of advanced technologies to improve agricultural productivity, efficiency, and profitability while minimizing waste and environmental impact. Deep learning approaches enable automated decision-making for many visual tasks. However, in the agricultural domain, variability in growth stages and environmental conditions, such as weather and lighting, presents significant challenges to developing deep learning-based techniques that generalize across different conditions. The resource-intensive nature of creating extensive annotated datasets that capture these variabilities further hinders the widespread adoption of these approaches. To tackle these issues, we introduce a semi-self-supervised domain adaptation technique based on deep convolutional neural networks with a probabilistic diffusion process, requiring minimal manual data annotation. Using only three manually annotated images and a selection of video clips from wheat fields, we generated a large-scale computationally annotated dataset of image-mask pairs and a large dataset of unannotated images extracted from video frames. We developed a two-branch convolutional encoder-decoder model architecture that uses both synthesized image-mask pairs and unannotated images, enabling effective adaptation to real images. The proposed model achieved a Dice score of 80.7\% on an internal test dataset and a Dice score of 64.8\% on an external test set, composed of images from five countries and spanning 18 domains, indicating its potential to develop generalizable solutions that could encourage the wider adoption of advanced technologies in agriculture.
Empirical Study of PEFT techniques for Winter Wheat Segmentation
Zahweh, Mohamad Hasan, Nasrallah, Hasan, Shukor, Mustafa, Faour, Ghaleb, Ghandour, Ali J.
Parameter Efficient Fine Tuning (PEFT) techniques have recently experienced significant growth and have been extensively employed to adapt large vision and language models to various domains, enabling satisfactory model performance with minimal computational needs. Despite these advances, more research has yet to delve into potential PEFT applications in real-life scenarios, particularly in the critical domains of remote sensing and crop monitoring. The diversity of climates across different regions and the need for comprehensive large-scale datasets have posed significant obstacles to accurately identify crop types across varying geographic locations and changing growing seasons. This study seeks to bridge this gap by comprehensively exploring the feasibility of cross-area and cross-year out-of-distribution generalization using the State-of-the-Art (SOTA) wheat crop monitoring model. The aim of this work is to explore PEFT approaches for crop monitoring. Specifically, we focus on adapting the SOTA TSViT model to address winter wheat field segmentation, a critical task for crop monitoring and food security. This adaptation process involves integrating different PEFT techniques, including BigFit, LoRA, Adaptformer, and prompt tuning. Using PEFT techniques, we achieved notable results comparable to those achieved using full fine-tuning methods while training only a mere 0.7% parameters of the whole TSViT architecture. The in-house labeled data-set, referred to as the Beqaa-Lebanon dataset, comprises high-quality annotated polygons for wheat and non-wheat classes with a total surface of 170 kmsq, over five consecutive years. Using Sentinel-2 images, our model achieved a 84% F1-score. We intend to publicly release the Lebanese winter wheat data set, code repository, and model weights.
Improving FHB Screening in Wheat Breeding Using an Efficient Transformer Model
Azad, Babak, Abdalla, Ahmed, Won, Kwanghee, Nafchi, Ali Mirzakhani
Fusarium head blight is a devastating disease that causes significant economic losses annually on small grains. Efficiency, accuracy, and timely detection of FHB in the resistance screening are critical for wheat and barley breeding programs. In recent years, various image processing techniques have been developed using supervised machine learning algorithms for the early detection of FHB. The state-of-the-art convolutional neural network-based methods, such as U-Net, employ a series of encoding blocks to create a local representation and a series of decoding blocks to capture the semantic relations. However, these methods are not often capable of long-range modeling dependencies inside the input data, and their ability to model multi-scale objects with significant variations in texture and shape is limited. Vision transformers as alternative architectures with innate global self-attention mechanisms for sequence-to-sequence prediction, due to insufficient low-level details, may also limit localization capabilities. To overcome these limitations, a new Context Bridge is proposed to integrate the local representation capability of the U-Net network in the transformer model. In addition, the standard attention mechanism of the original transformer is replaced with Efficient Self-attention, which is less complicated than other state-of-the-art methods. To train the proposed network, 12,000 wheat images from an FHB-inoculated wheat field at the SDSU research farm in Volga, SD, were captured. In addition to healthy and unhealthy plants, these images encompass various stages of the disease. A team of expert pathologists annotated the images for training and evaluating the developed model. As a result, the effectiveness of the transformer-based method for FHB-disease detection, through extensive experiments across typical tasks for plant image segmentation, is demonstrated.
Council Post: Artificial Intelligence And Precision Farming: The Dawn Of The Next Agricultural Revolution
Co-Founder and CTO of Prospera Technologies, leading the company's vision to transform the way food is grown using data science and AI. The human race has come a long way in our ability to produce food at scale. Historian and author Yuval Noah Harari refers to it in his book Sapiens as "an agricultural revolution," using wheat as an example. Ten thousand years ago, wheat was a wild grass that grew in a relatively small region in the Middle East. Today, wheat can be considered one of the most successful plants in history, according to the evolutionary criteria of survival and reproduction. In regions where wheat never existed, such as the Great Plains of North America, you can drive for hundreds of miles without seeing anything else but wheat fields.
Crop Height and Plot Estimation for Phenotyping from Unmanned Aerial Vehicles using 3D LiDAR
Dhami, Harnaik, Yu, Kevin, Xu, Tianshu, Zhu, Qian, Dhakal, Kshitiz, Friel, James, Li, Song, Tokekar, Pratap
We present techniques to measure crop heights using a 3D Light Detection and Ranging (LiDAR) sensor mounted on an Unmanned Aerial Vehicle (UAV). Knowing the height of plants is crucial to monitor their overall health and growth cycles, especially for high-throughput plant phenotyping. We present a methodology for extracting plant heights from 3D LiDAR point clouds, specifically focusing on plot-based phenotyping environments. We also present a toolchain that can be used to create phenotyping farms for use in Gazebo simulations. The tool creates a randomized farm with realistic 3D plant and terrain models. We conducted a series of simulations and hardware experiments in controlled and natural settings. Our algorithm was able to estimate the plant heights in a field with 112 plots with a root mean square error (RMSE) of 6.1 cm. This is the first such dataset for 3D LiDAR from an airborne robot over a wheat field. The developed simulation toolchain, algorithmic implementation, and datasets can be found on the GitHub repository located at https://github.com/hsd1121/PointCloudProcessing.
Why Video Game Creators Are Skipping the Bloodshed and Making 'Pacifist Games' Instead
A deer and a fawn are lost far from home, with no idea where they are or how they can get back to the woods. With the help of glowing antlers, curiosity and sheer will, the animals travel through wheat fields, abandoned subway systems and surreal sewers to make it back where they belong. This is Way to the Woods, a video game for PC and Xbox One due out in 2020 from an independent, 20-year-old developer named Anthony Tan. Following in the footsteps of similar so-called "pacifist" games like Journey, Firewatch and Night in the Woods, Way to the Woods is aimed at gamers looking for adult experiences that don't rely on violence to tell a compelling story. While President Donald Trump and others continue to suggest a connection between violent video games and mass shootings despite a lack of evidence, there has been a notable spike in games like Tan's that are more about exploration, story and design than racking up a body count.
Development of a Forecasting and Warning System on the Ecological Life-Cycle of Sunn Pest
Balaban, ฤฐsmail, Acun, Fatih, Arpalฤฑ, Onur Yiฤit, Murat, Furkan, Babaroฤlu, Numan Ertuฤrul, Akci, Emre, รulcu, Mehmet, รzkan, Mรผmtaz, Temizer, Selim
We provide a machine learning solution that replaces the traditional methods for deciding the pesticide application time of Sunn Pest. We correlate climate data with phases of Sunn Pest in its life-cycle and decide whether the fields should be sprayed. Our solution includes two groups of prediction models. The first group contains decision trees that predict migration time of Sunn Pest from winter quarters to wheat fields. The second group contains random forest models that predict the nymphal stage percentages of Sunn Pest which is a criterion for pesticide application. We trained our models on four years of climate data which was collected from Kir\c{s}ehir and Aksaray. The experiments show that our promised solution make correct predictions with high accuracies.
Subverting Our New Space Overlords
Complex financial information is hidden in plain sight all over the planet, according to James Crawford, CEO of Orbital Insight. The number of ships docked at a Malaysian port, even the color of a wheat field in western Nebraska, are actually signs, Crawford explained to me, visible indicators of economic activity, not just for a local region but for an entire global industry. Seen this way, mundane landscapes previously deemed unworthy of analysis can, in fact, be meticulously--and profitably--scrutinized. This newfound appreciation is not aesthetic, of course, but fiscal, as even the growing shadows of a Chinese construction site can be interpreted as valuable clues about the strength of the underlying economy. Crawford's company, Orbital Insight, is one of a new breed of market-research firms pioneering the use of high-resolution satellite imagery.